Trees Assembling Mann-Whitney approach for detecting genome-wide joint association among low-marginal-effect loci.
نویسندگان
چکیده
Common complex diseases are likely influenced by the interplay of hundreds, or even thousands, of genetic variants. Converging evidence shows that genetic variants with low marginal effects (LMEs) play an important role in disease development. Despite their potential significance, discovering LME genetic variants and assessing their joint association on high-dimensional data (e.g., genome-wide data) remain a great challenge. To facilitate joint association analysis among a large ensemble of LME genetic variants, we proposed a computationally efficient and powerful approach, which we call Trees Assembling Mann-Whitney (TAMW). Through simulation studies and an empirical data application, we found that TAMW outperformed multifactor dimensionality reduction (MDR) and the likelihood ratio-based Mann-Whitney approach (LRMW) when the underlying complex disease involves multiple LME loci and their interactions. For instance, in a simulation with 20 interacting LME loci, TAMW attained a higher power (power = 0.931) than both MDR (power = 0.599) and LRMW (power = 0.704). In an empirical study of 29 known Crohn's disease (CD) loci, TAMW also identified a stronger joint association with CD than those detected by MDR and LRMW. Finally, we applied TAMW to Wellcome Trust CD GWAS to conduct a genome-wide analysis. The analysis of 459K single nucleotide polymorphisms was completed in 40 hrs using parallel computing, and revealed a joint association predisposing to CD (P-value = 2.763 × 10(-19)). Further analysis of the newly discovered association suggested that 13 genes, such as ATG16L1 and LACC1, may play an important role in CD pathophysiological and etiological processes.
منابع مشابه
A likelihood ratio-based Mann-Whitney approach finds novel replicable joint gene action for type 2 diabetes.
The potential importance of the joint action of genes, whether modeled with or without a statistical interaction term, has long been recognized. However, identifying such action has been a great challenge, especially when millions of genetic markers are involved. We propose a likelihood ratio-based Mann-Whitney test to search for joint gene action either among candidate genes or genome-wide. It...
متن کاملUnveiling the genetic loci for a panicle developmental trait using genome-wide association study in rice
Panicle size has a high correlation with grain yield in rice. There is a bottleneck to identify the additional quantitative trait loci (QTL) for panicle size due to the conventional traits used for QTL mapping. To identify more genetic loci for panicle size, a panicle developmental trait (LNTB, the length from panicle neck-knot to the first primary branch in the rachis) related to panicle size ...
متن کاملThe Pattern of Linkage Disequilibrium in Livestock Genome
Linkage disequilibrium (LD) is bases of genomic selection, genomic marker imputation, marker assisted selection (MAS), quantitative trait loci (QTL) mapping, parentage testing and whole genome association studies. The Particular alleles at closed loci have a tendency to be co-inherited. In linked loci this pattern leads to association between alleles in population which is known as LD. Two metr...
متن کاملTests for Gene-Environment Interactions and Joint Effects with Exposure Misclassification Running head: GxE Interactions with Exposure Misclassification
The number of methods for genome-wide testing of gene-environment interactions (GEI) continues to increase with the hope of discovering new genetic risk factors and obtaining insight into the disease-gene-environment relationship. The relative performance of these methods based on family-wise type 1 error rate and power depends on underlying disease-gene-environment associations, estimates of w...
متن کاملSpecies delimitation using genome-wide SNP data.
The multispecies coalescent has provided important progress for evolutionary inferences, including increasing the statistical rigor and objectivity of comparisons among competing species delimitation models. However, Bayesian species delimitation methods typically require brute force integration over gene trees via Markov chain Monte Carlo (MCMC), which introduces a large computation burden and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Genetic epidemiology
دوره 37 1 شماره
صفحات -
تاریخ انتشار 2013